智能论文笔记

Towards an Optimal Hybrid Algorithm for EV Charging Stations Placement using Quantum Annealing and Genetic Algorithms

Aman Chandra , Jitesh Lalwani , Babita Jajodia

分类：神经与进化计算

2021-11-02

Quantum退火是求解优化问题的启发式，这些问题由于D波系统的成功而言，已经看到了最近使用的浪涌。本文旨在找到一种解决电动车充电器放置（EVCP）问题的良好启发式，这是一个问题，这是一个非常重要的问题，但由于建立电动汽车（EV）充电器以及电动汽车的预期浪涌世界。同样的问题陈述也可以推广到网格中的任何实体的最佳位置，可以探索进一步用途。最后，作者介绍了一种新的启发式组合Quantum退火和遗传算法来解决问题。所提出的混合方法需要将遗传算法播种与量子退换器的结果。我们的实验表明，与我们样本EVCP数据集上的香草Quantum退火相比，该方法将该方法降低42.89％的距离。

translated by 谷歌翻译

OneFormer: One Transformer to Rule Universal Image Segmentation

Jitesh Jain , Jiachen Li , MangTik Chiu , Ali Hassani , Nikita Orlov , Humphrey Shi

分类：计算机视觉

2022-11-10

Universal Image Segmentation is not a new concept. Past attempts to unify image segmentation in the last decades include scene parsing, panoptic segmentation, and, more recently, new panoptic architectures. However, such panoptic architectures do not truly unify image segmentation because they need to be trained individually on the semantic, instance, or panoptic segmentation to achieve the best performance. Ideally, a truly universal framework should be trained only once and achieve SOTA performance across all three image segmentation tasks. To that end, we propose OneFormer, a universal image segmentation framework that unifies segmentation with a multi-task train-once design. We first propose a task-conditioned joint training strategy that enables training on ground truths of each domain (semantic, instance, and panoptic segmentation) within a single multi-task training process. Secondly, we introduce a task token to condition our model on the task at hand, making our model task-dynamic to support multi-task training and inference. Thirdly, we propose using a query-text contrastive loss during training to establish better inter-task and inter-class distinctions. Notably, our single OneFormer model outperforms specialized Mask2Former models across all three segmentation tasks on ADE20k, CityScapes, and COCO, despite the latter being trained on each of the three tasks individually with three times the resources. With new ConvNeXt and DiNAT backbones, we observe even more performance improvement. We believe OneFormer is a significant step towards making image segmentation more universal and accessible. To support further research, we open-source our code and models at https://github.com/SHI-Labs/OneFormer

translated by 谷歌翻译

Self-adversarial Multi-scale Contrastive Learning for Semantic Segmentation of Thermal Facial Images

Jitesh Joshi , Nadia Bianchi-Berthouze , Youngjun Cho

分类：计算机视觉

2022-09-21

在不受约束的环境（例如热环境和遮挡）中对热面部图像的可靠分割是具有挑战性的，因为面部特征缺乏显着性。从此类设置的数据集的有限可用性进一步使训练细分网络变得困难。为了应对挑战，我们提出了自我反向的多尺度对比学习（SAM-CL），作为训练分割网络的通用学习框架。 SAM-CL框架构成SAM-CL损耗函数和热图像增强（TIAUG）作为特定域的增强技术，以根据从受控设置收集的现有数据集模拟无约束的设置。我们使用热面数据库来证明我们的方法的有效性。在现有的分割网络中进行的实验 - UNET，注意力UNET，DEEPLABV3和HRNETV2证明SAM-CL框架的绩效增长一致。此外，我们通过Ubcomfort和Deepbreath数据集进行了定性分析，以讨论我们提出的方法在处理无约束情况方面的作用。

translated by 谷歌翻译

LAMA-Net: Unsupervised Domain Adaptation via Latent Alignment and Manifold Learning for RUL Prediction

Manu Joseph , Varchita Lalwani

分类：机器学习

2022-08-17

预后和健康管理（PHM）是一个新兴领域，由于其带来的好处和效率，它引起了制造业的广泛关注。剩余的使用寿命（RUR）预测是任何PHM系统的核心。最新数据驱动的研究要求大量标记的培训数据可以在有监督的学习范式下培训表现模型之前。在这里，转移学习（TL）和域适应（DA）方法介入并使我们有可能将监督模型概括为具有不同数据分布的其他没有标记数据的其他域。在本文中，我们提出了一种基于编码的模型（变压器），该模型（变压器）具有诱导的瓶颈，使用最大平均差异（MMD）的潜在对齐，并提出了歧管学习，以解决无监督的同质域的问题适应Rul预测。 \ textit {lama-net}使用NASA使用C-Mapss Turbofan引擎数据集验证，并将其与DA的其他最新技术进行了比较。结果表明，所提出的方法提供了一种有希望的方法来在RUL预测中进行域适应。一旦纸张退出审查，将提供代码。

translated by 谷歌翻译

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand

Jitesh Jain , Yuqian Zhou , Ning Yu , Humphrey Shi

分类：计算机视觉

2022-08-05

深层图像介绍取得了令人印象深刻的进步，随着图像产生和处理算法的最新进展。我们声称，可以通过生成的结构和纹理更好地判断介入算法的性能。结构是指孔中生成的对象边界或新的几何结构，而纹理是指高频细节，尤其是在结构区域内填充的人造重复模式。我们认为，更好的结构通常是从基于粗糙的GAN的发电机网络中获得的，而如今重复模式可以通过最新的高频快速快速傅立叶卷积层进行更好的建模。在本文中，我们提出了一个新颖的介绍网络，结合了这两种设计的优势。因此，我们的模型具有出色的视觉质量，可以匹配结构生成和使用单个网络重复纹理合成的最新性能。广泛的实验证明了该方法的有效性，我们的结论进一步突出了图像覆盖质量，结构和纹理的两个关键因素，即未来的设计方向。

translated by 谷歌翻译

Machine Learning in Sports: A Case Study on Using Explainable Models for Predicting Outcomes of Volleyball Matches

Abhinav Lalwani , Aman Saraiya , Apoorv Singh , Aditya Jain , Tirtharaj Dash

分类：机器学习 | 人工智能

2022-06-18

机器学习已成为包括运动在内的多个领域的工程设计和决策的组成部分。深度神经网络（DNNS）一直是预测职业体育赛事结果的最新方法。但是，除了对这些体育活动成果进行高度准确的预测外，还必须回答诸如“为什么模型预测A团队会赢得与B队的比赛？”之类的问题？ DNN本质上是本质上的黑框。因此，需要为模型在运动中的预测提供高质量的可解释的解释性解释。本文探讨了两步可解释的人工智能（XAI）方法，以预测巴西排球联盟（Superliga）中比赛的结果。在第一阶段，我们直接使用可解释的基于规则的ML模型，这些模型可以根据布尔规则列的生成（BRCG；提取简单和 - 或分类规则）和逻辑回归（logReg;允许估算）对模型的行为进行全局理解。功能重要性得分）。在第二阶段，我们构建了非线性模型，例如支持向量机（SVM）和深神经网络（DNN），以在排球比赛的结果上获得预测性能。我们使用ProtoDash为每个数据实例构建了“事后”解释，该方法在训练数据集中找到原型，与测试实例最相似，而Shap是一种估计每个功能在模型预测中的贡献的方法。我们使用忠诚度量标准评估了摇摆的解释。我们的结果证明了对模型预测的解释的有效性。

translated by 谷歌翻译

Logical Fallacy Detection

Zhijing Jin , Abhinav Lalwani , Tejas Vaidhya , Xiaoyu Shen , Yiwen Ding , Zhiheng Lyu , Mrinmaya Sachan , Rada Mihalcea , Bernhard Schölkopf

分类：自然语言处理 | 人工智能 | 机器学习

2022-02-28

Reasoning is central to human intelligence. However, fallacious arguments are common, and some exacerbate problems such as spreading misinformation about climate change. In this paper, we propose the task of logical fallacy detection, and provide a new dataset (Logic) of logical fallacies generally found in text, together with an additional challenge set for detecting logical fallacies in climate change claims (LogicClimate). Detecting logical fallacies is a hard problem as the model must understand the underlying logical structure of the argument. We find that existing pretrained large language models perform poorly on this task. In contrast, we show that a simple structure-aware classifier outperforms the best language model by 5.46% on Logic and 4.51% on LogicClimate. We encourage future work to explore this task as (a) it can serve as a new reasoning challenge for language models, and (b) it can have potential applications in tackling the spread of misinformation. Our dataset and code are available at https://github.com/causalNLP/logical-fallacy

translated by 谷歌翻译

SeMask: Semantically Masked Transformers for Semantic Segmentation

Jitesh Jain , Anukriti Singh , Nikita Orlov , Zilong Huang , Jiachen Li , Steven Walton , Humphrey Shi

分类：计算机视觉 | 机器学习

2021-12-23

在图像变压器网络的编码器部分中的FineTuning佩带的骨干网一直是语义分段任务的传统方法。然而，这种方法揭示了图像在编码阶段提供的语义上下文。本文认为将图像的语义信息纳入预磨料的基于分层变换器的骨干，而FineTuning可显着提高性能。为实现这一目标，我们提出了一个简单且有效的框架，在语义关注操作的帮助下将语义信息包含在编码器中。此外，我们在训练期间使用轻量级语义解码器，为每个阶段提供监督对中间语义的先前地图。我们的实验表明，结合语义前导者增强了所建立的分层编码器的性能，随着絮凝物的数量略有增加。我们通过将Sromask集成到Swin-Cransformer的每个变体中提供了经验证明，因为我们的编码器与不同的解码器配对。我们的框架在CudeScapes数据集上实现了ADE20K数据集的新型58.22％的MIOU，并在Miou指标中提高了超过3％的内容。代码和检查点在https://github.com/picsart-ai-research/semask-egation上公开使用。

translated by 谷歌翻译